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Abstract We introduce a statistical mechanics formalism for the study of constrained graph evolution 
as a Markovian stochastic process, in analogy with that available for spin systems, deriving its basic 
properties and highlighting the role of the 'mobility' (the number of allowed moves for any given 
graph). As an application of the general theory we analyze the properties of degree-preserving Markov 
chains based on elementary edge switchings. We give an exact yet simple formula for the mobility in 
terms of the graph's adjacency matrix and its spectrum. This formula allows us to define acceptance 
probabilities for edge switchings, such that the Markov chains become controlled Glauber-type detailed 
balance processes, designed to evolve to any required invariant measure (representing the asymptotic 
frequencies with which the allowed graphs are visited during the process). As a corollary we also derive 
a condition in terms of simple degree statistics, sufficient to guarantee that, in the limit where the 
number of nodes diverges, even for state-independent acceptance probabilities of proposed moves the 
invariant measure of the process will be uniform. We test our theory on synthetic graphs and on 
realistic larger graphs as studied in cellular biology. 
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1 Introduction 

The aim of tliis paper is to develop a matliematical framework for the study of stochastically evolving 
graphs, defined in terms of generic Markov chains that describe constrained edge re-wiring, and to 
analyze within this framework the Markovian edge swap dynamics as the simplest non-trivial example 
of a constrained stochastic graph dynamics. 

The formalism which we present can be seen as the analogue of the one available for interacting 
spin systems. In the latter case, the elementary move is typically a single spin flip, and constrained 
dynamics have been considered, e.g. by simultaneously flipping pairs of oppositely oriented spins to 
preserve the total magnetization [1,2]. In extending this theory to evolving graphs we are motivated by 
the necessity to bridge an existing gap between the static and the dynamical treatment of graphs. While 
many statistical aspects of random graph topology are now understood (like the influence of topology 
on processes occurring on graphs, percolation and critical phenomena, loop statistics, or the entropies 
of different topologies in various random graph ensembles [3-10]), much less work has been invested 
in the mathematical study of the dynamics of graphical structures (see [11 13] for recent examples). 
Besides their mathematical interest, dynamical problems arc prominent in application areas where the 
issue of sampling uniformly the space of graphs with certain prescribed macroscopic properties is vital. 
Multiple examples of these are found in systems biology, where different graph randomization protocols 
have for instance been tested and used to identify the elementary bricks ('motifs') characterizing such 
networks, such as the transcriptional regulation network of the bacterium E. coli [14-16]. Many other 
examples can be found in economics, ecology and the social sciences, see e.g. [17,18]. At the level 
where the constraints involve only the simplest quantities, i.e. the degree sequence, even generating 
such graphs is known to be a non-trivial problem that has produced much inspiring work and some hard 
open questions [19-22]. It is known, for instance, that a Markov chain based on degree-preserving edge 
swaps ('switchings') starting from a given graph does not generically produce a uniform sampling of the 
space of graphs with the same degree sequence, and heuristics with various degrees of sophistication 
and effectiveness have been employed to restore the uniform measure [23 31]. When more complicated 
observables than the degree sequence are involved (like the degree correlations or the number of loops 
of a given length through each node) the situation rapidly becomes more difficult, both mathematically 
and numerically. 

Here we adopt a top-down approach, and treat the problem of constrained Markovian graph evo- 
lution first as generally as possible before dealing with specific instances. Markov chains describing 
Glauber-type controlled equilibrium graph dynamics for generic invcrtible elementary moves are con- 
structed and analyzed in Section 2, including their equilibration properties. Section 3 then focuses on a 
particular class of moves, namely degree-preserving edge swaps, both in view of their wide application 
in computer studies and because they are the simplest degree-constrained moves. Among other results, 
we establish a simple formula for the number of allowed switchings from a given graph configuration 
(the graph's 'mobility') and quantify the dynamical relevance of mobility-related entropic effects. This 
formula allows us to derive the correct acceptance probabilities for randomly drawn candidate edge 
swaps that guarantee degree-constrained stochastic evolution towards any desired stationary measure, 
as well as a condition on degree statistics sufficient to ensure that for uniform acceptances of random 
edge-swaps the asymptotic measure over the space of allowed graphs generated by the dynamics be- 
comes uniform (implying abscnc;c> of mobility-related entropic efli'ccts). Our results are illustrated and 
validated via application to various synthetic and biological graphs. 



2 Formalism and general properties of controlled Mgirkovian graph dynamics 

2.1 Basic definitions 

We study graphs consisting of N nodes (labeled by Roman indices i = 1 . . . A^) that can be linked by 
undirected bonds. A graph is defined microscopically by its adjacency matrix c = {c^}, where = 1 
if and only if nodes i and j are connected and Cij = otherwise, and it is assumed that Cij = Cjj and 
Cii = for all We denote the discrete set of all such graphs as C = {0, Our aim is 

to define and study constrained Markov chains for the evolution of c in some subspace i? C C, i.e. 
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discrete-time stochastic processes for the probabihty pt (c) of observing graph c at time t of the type 

Veen : pt+i{c)=Y,W{c\c')pt{c') (1) 

with t e IN, W{c\c') > Vc,c' e n, and J2cei2^i^\^') = 1 Vc' e f2. Here the quantity W{c\c') 
denotes the single-step transition probabihty from graph c' to graph c. We focus on processes of the 
form ^ that have the foUowing additional properties: 

(i) The process ([T]) allows only for a given limited set of elementary moves F : flp f2, which are 
constrained, in that each F can act only on a subset Hp C /2 of all possible graphs. 

(ii) The process ([T]) converges to the invariant measure Poo(c) = Z^^e^-^^^^^ on ]7, in which H is a 
prescribed function, for any choice of initial conditions po{c). Here Z = X^cer? ^'^ ensure 
normalization. 

For each elementary move F G <P we define an indicator function /f(c) G {0, 1}, where If(c) = 1 if 
and only if the move c — > Fc is allowed. In addition to the above requirements we will also demand 
that all elementary moves are invertible, i.e. 

(iii) For each F € there exists a unique F^^ € such that FF~^ — F~^F = I. Both F and F~^ are 
taken to act on the same subset of states, i.e. /_f(c) — Ip-i{c) for all c E f2. 

Processes of the form ([T|) can be used to generate graphs with controlled properties, such as built-in 
constraints (which can be encoded in the subset f2 of allowed graphs, and induce the limitations on 
the applicability of moves that generate the subsets ^2p) and specific statistical weights (which can be 
encoded in the invariant measure poo(c), i.e. in the function H{c)). The scenario of having a limited 
set of elementary moves, which each can act only on certain configurations c, describes many of the 
commonly studied dynamical processes for graphs. In the examples that will be worked out explicitly 
in subsequent sections, ^ is the set of elementary moves that preserve the degree ki{c) = '^jCij of 
every node, the simplest possible such move F being an edge-swap between two pairs of nodes. 

Our next task is, for any given subset C C of states, any given set of possible moves with 
state-dependent application constraints that meet conditions (i,iii), and any given measure Poc{c) of 
the form (ii), to construct appropriate transition probabilities VK(c|c') such that the state probabilities 
Pt(c) generated by the Markov chain ^ are guaranteed to converge to the desired values poo(c). 



2.2 The Markov chain transition probabilities 

In order to construct the transition probabilities I4^(c|c'), we can resort to the familiar ideas behind 
Monte-Carlo (or Glauber-type) processes for the simulation of physical systems, provided these are 
properly adapted to build in the constraints on the applicability of the allowed graph transitions 
i*" g In particular, we will construct our transition probabilities such that the corresponding process 
obeys detailed balance, i.e. 

Vc,c'el2 : VF(c|c')Poo(c') = VK(c'|c)poo(c) (2) 

Summation over c' e in ([2]) reveals in the usual manner that ([2]) implies stationarity of Poo, i-e. 
Vc e i? : ^^ii^QW{c\c')poa{c') — Poo(c). We next define a new set of moves which excludes the 
identity operation: 

<P' ^ {F G<P\3cG Q such that Fc ^ c) (3) 
A generic detailed balance Markov chain is then obtained by choosing 

W^(c|c') ^ W,Fc'A{Fc'\c') + 5e,c' [1 - A{Fc'\c')]\ (4) 



The rationale and interpretation of this choice (j4|) is as follows. At each step a candidate move F e 
is drawn with some probability g(F|c'), where c' denotes the current state. This move is accepted 
(and the transition c' ^ c = Fc' is executed) with some probability A(Fc'|c') e [0, 1], which depends 
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on both the current state c' and on the proposed new state Fc' . If the move is rejected, which 
happens with probabihty 1 — A(Fc'|c'), the system remains in the current state c'. Clearly (j4|) obeys 
Tliceo W^(c|c') = 1 for all c' S J7, as it should. Working out the detailed balance condition ([2]), upon 
writing the equilibrium state in the Boltzmann form Poo(c) = Z"^ exp[— 77(c)], leads to the following 
conditions for q{F\c) and A(c|c'): 

(Vcer2)(VFe^'') : q{F\c)A{Fc\c)e-"^''^ = q{F^^\Fc)A{c\Fc)e-"^'"''^ (5) 

Now let n{c) denote the number of moves that can act on a state c (to which we shall refer as the 
'mobility' of state c), defined as 

n(c) = ^^(^)- (6) 

The fact that for constrained moves these numbers 7t,(c) are generally state-dependent, forces us to 
choose Monte-Carlo acceptance probabilities A(c|c') that no longer depend on H{c) — H{c') only. If 
the candidate moves F are drawn randomly and with equal probabilities from those that are allowed 
to act, we have q{F\c) = Ip{c)/n{c) and the detailed balance condition becomes 

(Vcel2)(VFe^>') : A{Fc\c)e-"^''^ /n{c) ^ A{c\Fc)e-"^''''^ /n{Fc) (7) 

It is then clear that having a state-dependent n(c) is equivalent to modifying the state energies, viz. 
H{c) H{c) + logn(c), so that the Monte-Carlo acceptance rates can be chosen as 

(note that move reversibility implies n{c) > 1) and the end result is the Markov chain defined by the 
transition probabilities 



n(c') i „(c')e-5[«(^''=')--ff(c')] +n(Fc')e^[-f^(-P^'^')--f^('=')l J 

One easily confirms by direct substitution that the transition probabilities (O indeed define a Markov 
process which leaves the measure Poo(c) = 2^^^ exp[— iJ(c)] invariant, since for any c G we find, 
using simple identities such as If{c) — Ip-^ (c) = If{Fc) and J^Fe'P' ^(^) = X^fg*' ^(^^^)' ^^^^ 

^ I¥(c|c')Poo(c') = I ^ I^(c|c')e-^('^') 

^'^t^ /f(c') ^<5J.-lc.c'f^(c')e~5[«('=')-^(^^-')l +4,,,n(Fc')e3[^(^''=')-^f(c')] 



e 



If{c) J n(c)e 



^"^''^fS \n{F-^c)e-'^^H(c)-H{F-^.)]^^^^y^[mc)~H(F-^c)] 



n(Fc)e2 



i[H(Fc)-H(c)] 



^-^ n(c) 



5 



2.3 Master equation representation of the process 



The process defined by ()1I9|) allows for relatively easy numerical implementation, but for mathematical 
analysis a real-time formulation in the form of a master equation is more convenient. The formal method 
to go from a process of the form ([T]) to a master equation, is to assume that the duration of each of 
the discrete iteration steps in ([T]) is a continuous random number [32] . The statistics of these random 
durations are defined by the probability Wm {t) that at time t>Q precisely m iteration steps have been 
made. Our new real-time process is now described by 

Pt{c) = V 7r^{t)prn{c) = V V ( W^" ) (c| c')po (c') (11) 



m>0 m>0 e'en 



where the time t has now become a continuous variable. For TTm(t) we make the Poissonnian choice 
T^mit) = (t/T)™e^*/'^/TO!, with the properties 

^7r„i>o(t) = [7r„i_i(i) - TTmit)] , ^^'^o^*) ^ -T"^7r,„(i) (12) 

From (m)7r = i/r it follows that r is the average duration of a single discrete iteration step. Our choice 
for TTm{t) allows us to write for the time derivative of Pt{c): 

^^^*(^) = E '^"-i W E iWmc\c')po{c') ~ J2 '^"^ W E (W^")(c|c')Po(c') (13) 

m>o e'en m>o e'en 

= -Pt{c) + J2 Wic\c')Ptic') (14) 
e'en 

which has the form of a master equation. Applying this procedure to our present Markov chain pi9p 
gives its corresponding master equation which (after re-arranging of terms) takes the transparent form 

d , , . ^ f 1 n(Fc)e^[^(^'=)-f^('^)lPt(Fc) 



'i* f^' ' ' |n(Fc) n(c)e-i[^(^'=)--f^('^)l +n(Fc)ei[«(^'^)-ff('=)l 

1 n(c)e-^[^(^'=)-^('^)lPt(c) 



n{c) n(c)e-^[^(^'=)-^('^)l -Hn(Fc)e^[^(^'=)-^('^)] 



E^Hc){ 



Fe<P' 



where, using the the short-hand AfU{c) = U{Fc) — U{c), we have identified the transition rates 
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For N —^ 00 there can be no difference between the process (|15p . describing random durations of the 
steps of the Markov chain, and one where the duration of each step would be r (rather than just 
their average). This follows from the moments of the Poisson process, viz. {m?)-,^ = -f (m)7r, 

which guarantee that for finite real time t the relative uncertainty in the number of iterations will 
vanish according to (rn?-)-K — (™) J/ — 0{N^^^'^). If we were to repeat the above derivation for 
a process where each randomly chosen and possible edge swap was always accepted, we would again 
find equation (fT5|) . but now with the trivial transition rates Wp{c) — 1 for all c e /2 and all F E (l>' . 

From (|15ll6p one can proceed to derive convenient dynamical equations for ensemble averages. 
These are particularly compact and transparent for those situations where ~ F for all F G <P' , 
which will, for instance, be true for edge swap graph dynamics. To be specific, upon defining (/(c)) = 
J2een Pt{c)f{c) for any arbitrary observable /(c), we obtain, using identities such as If{Fc) — /_f(c), 
AfU{Fc) ^ -AfU{c) and wj.{Fc) = Wp{c) (which all follow directly from p-'^ = F): 

d 

^Tt 



d7(/(^))= E (^{/(^^c)u;+(^^c)-/(c)^^(c)}) 
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2.4 Convergence towards equilibrium 



Convergence of (|15ll6p towards the equilibrium state generally implies that the same is true for the 
underlying Markov chain (|1I9[P ]. The physically most intuitive convergence proofs are based on con- 
structing a Lyapunov function F{t). Here we choose for F{t) the KuUback-Leibler divergence between 
the equilibrium state Poo(c) = exp[— iJ(c)] and the instantaneous distribution Pt{c), which obeys 
F{t) > for all t by definition, and is zero only if the measures Pt(c) and Pcx)(c) are identical: 



F{t) = ^Pt(c)log[Pt(c)/poo(c)] =logZ + ^Pt(c)[i/(c)+logPt(c)] 



(18) 



cGJ'2 



cGl7 



The proof that F{t) decreases monotonically is standard and relies only on the detailed balance condi- 
tion ([2]) and on the normalization J2cef2 W{c\c') = 1 Vc' e fi. We repeat it here only for completeness, 
and to show that it can handle the constraints in the graph dynamics in a straightforward manner: 

^P(t) = ^[H(c)+logP,(c)]-|p(c) 



^Y.[H{c)+logP,{c)][Y, W{c\c')Pt{c') - Pt{c 

cef2 e'en 

^Y.iH{c)+logPt{c)] J2 [w^(c|c')Pt(c')-l^(c'|c)P,(c) 

cen e'en 

h E [[^(c) + logPt(c)] - [F(c') + logPt(c')]] [w{c\c')Pt{c') - W^(c'|c)P,(c) 

c,c' Gf2 

^ [[H(c) + logP,(c)]-[H(c') + logP,(c')] 

gH(c')+logA(c')p^(j.|^/)g-H(c') _ gff(c)+logP,(c)^(j,/|^)g-//(c 



2t 



c,c' Gf2 



(19) 



We next use the detailed balance identity ^ and obtain 

1 

'27 



= E W^(c|c')e-^('^')[[i?(c)+logP(c)] - [i/(c') + logP(c') 

c,c' Gi7 



gff(c)+logPt(c) _ ^H(c')+logPt(c') 



< 



(20) 



The last step derives from the general identity (e^ — ey){x ~ y) > for all {x,y), with equality only 
if a:; = y. Since F{t) is bounded from below, it is indeed a Lyapunov function for the process ()15p . 
The distance between Pt(c) and the equilibrium measure Poo (c) = Z~^e~^^'^^ decreases monotonically 
until dF{t)/dt = 0. Inspection of the conditions for equality in (PO)) shows that F{t) stops decreasing 
only if Pt(c) has reached a point P(c) = x(c)e~^''^-' with: 



(Vc,c' e n) : W{c\c') = or x(c) = x(c') 



(21) 



If we define f2c = {c' e i7| 3^ g IN : (II^'^)(c'|c) > 0} (representing the set of all states that 
are accessible from c via repeated iteration of the Markov chain), then for all c' S Qc we must 
have x(c') = x(c). Each solution of ([2T|) generates an invariant measure under the dynamics, since 
substitution of P(c) = x(c)e^'^'''^-' in both sides of p4)) shows, using ([2]), 



^ M^(c|c')x(c')e-^('^') - X(c)e- 



H{c) 



e'en 



-^('^^[^ X(c')mc'lc)-; 

e'en 



e'en 



(22) 



^ Exceptions to this would be e.g. periodic Markov chains. 
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One such stationary solution P(c) is the equihbrium measure, corresponding to = for aU 
c e J7. If our Markov chain is ergodic, i.e. if i7c = J7 for all c € then it is the only such state to 
satisfy (|2ip . and our process must always evolve towards Poo(c) = exp[— i?(c)]. 

We shall next focus our attention on a specific class of moves that preserve the values of all node 
degrees fci(c) = ^ Cy , and work out the details of the corresponding equilibrium dynamics. 



3 Degree-constrained dynamics 



3.1 Elementary moves defined on the basis of 'edge swaps' 

Here we use the results of the previous section to construct stochastic processes for evolving graphs 
which preserve their full degree sequence k(c) — {A:i(c)}, and in addition evolve (within the subspace 
of graphs with fixed degree sequence k) to any desired prescribed measure of the form Poo(c) = 
exp[— _ff (c)]. Let /?[k] denote the discrete set of all un-directed graphs that have the specified 
degree sequence k: 



/2[k] = {c e {0, I fc,(c) = fc, Vi = 1 . . . iV} 



(23) 



This set will play the role of our state space J7 in ([T]). For the elementary moves of our Markov chain 
we choose the so-called 'edge swaps' (simple cases of the more general family of Seidel switches [33]), 
which are the simplest possible graph transitions that conserve the values of all degrees (see Appendix 
E|- An edge swap is defined by the following protocol: 

(a) draw four distinct nodes {i,j,k,£) 

(b) check whether {ca, Cjk, Cij,cu) = (1,1,0,0) 

(c) if yes, invert these four variables: {cu, Cjk, Cij, cm) (0, 0, 1, 1) 

We generate the set <P' of such moves by summing uniquely over all quadruplets of nodes, with validity 
checks for all possible edge swaps that could be carried out for each quadruple. To do this carefully we 
define Q = {{i,j,k,l) G {1, . . . , A^}''! «<j<fc<^}, and we no longer allow for any permutations of the 
four nodes. The six potential edge swaps are then found to be the following, with thick lines indicating 
existing links, and thin lines indicating absent links that will be swapped with the existing ones: 



I 



II 



III 



k 





together with their inverse edge swaps 

IV 
i j 



i k 



V 



i J 

X 



VI 




k i k 



This leads to a natural grouping of edge swaps into the three pairs (I, IV), (II, V), and (III, VI). We 
label all three resulting auto-invertible operations for each ordered quadruple {i,j,k,£) by adding a 
subscript a, so that our set of all auto-invertible edge swaps are from now on written as Fijki-a with 
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i < j < k < £ and a E {1,2,3}. We define suitable associated indicator functions Iijki-a{c) £ {0, 1} 



that detect wlietlier (1) or not (0) the edge swap Fijke-a can act on state c, so 

lijkiaic) = CijCkeil - Cii){l - Cjk) + (1 - Cy)(l - Cki)cuCjk (24) 

Iijki;2{c) = CijCkeil ~ Cik){l - Cjf) + (1 - Cij){l - Cki)cikCje (25) 

Iijkiisic) = CikCjf {l - Cie){l - Cjk) + (1 - Cjfe)(l - Cje)ciiCjk (26) 

If Fijki-a can indeed act, i.e. if Iijke-a{c) = 1, the edge swap will operate as follows: 

Fijki;a{c)qr = 1 ~ C^r for ((?, r) £ Sijkl;a (27) 

Fijki;a{c)qr = Cqr fOT (q, r) ^ Si-jkl-a (28) 

'^^^''^ S^Jki,l = {k,l), (*,£), U,k)} (29) 

S^,ki;2^{{i,j)Ak,i)Ahk),{j,m (30) 

S,,ki;3 = {{i,k),{j,^),{hi),{j,k)} (31) 

Each edge swap is its own inverse, so the property If{c) — Ip-i (c), that we relied upon several times, 
is trivially valid. The link with the theory in the previous section thus becomes, for arbitrary G{F), 

F{c) ^ F,,ki;a{c) (32) 

If{c) Iijkt,a{c) (33) 

Qpic) ^ %fc,;„(c) = {ce{0,l}^^(^-i)| /yM;„(c) = 1} (34) 

Y.G{F) ^ T.G{F^,ki;.) (35) 

Fe<P' i<j<k<l Q!<3 



3.2 The number of possible edge swaps - graph mobility 

Given the above definitions, the number n(c) of possible edge swaps that can act on a given graph c 
(its mobility) is given by the following expression 

3 

n(c) = ^ X! Ii]kl;a{c) (36) 

a=l i<j<k<l 

However, it is possible to obtain a simplified formula for n(c) in terms mainly of the degree and loop 
statistics of c, by exploiting the fact that there are 4! = 24 possible orderings of the four indices 
(i,j, fc,£), which implies that for any fully permutation invariant quantity Fijke one may always write 
(with the short-hand Sij = I ~ Sij): 

Fijke ~ -^'Y^SijSikSuSjkSjiSkiFijke (37) 

i<j<k<i ijkl 

We can use this to write ([55]) . using ca — for all i, in the alternative form 

"(c) = X! \^C.ijCkt{l ~ Cu)(l ~ Cjk) + CitCjki'^ ~ Cij){l - Ckl) 
i<j<k<e 

+ CijCke{l - Cik){l - Cji) + CikCji(l - Cij)(l - Cki) 



+ CikCji{l - cu)(l - Cjk) + cuCjk(^ - Cifc)(l - Cj£)| 



^ 5ik5it5jk5jiCijCkt{l - Cie){l - Cjk) + X/ ^v^ikSjeSkeCieCjkil - Cij){l - cki) 

ijkl ijk£ 

+ T77 Sij'SteSjkSkeCzkCjejl - Cif)(l - Cjk) 
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= -^"^{^ ^ ^ik - - Sjk - Sj£ + SikSje + Sii5jk)cijCk£{l - Cii){l - Cjk) 

ijkl 

+ "^{^ - ^ij - ^tk - - Ski + SijSke + 6ikS-ii)ciiCjk{l - c.ij)(l - Cke) 

ijkl 

+ "^{^ - Si] - Sit - Sjk - Ski + 5ij6kl + Su6jk)CikCji{l - Cu){l - Cjk) 
ijki 

= i X! CyCfe£(l - Cie){l - Cjk) + - - c^fe) 

2j ijk 



where fc^ = J2j ^ij the degree of node i. 

Formula ([55]) plays a key role in the construction of our controlled Markov chair0, since we have 
seen earlier that any dependence of n{c) on the state c that cannot be expressed in terms of the degree 
sequence only will generate entropic preferences of certain graphs over others. In appendix[B]we extract 
from ((38)) the following rigorous bounds for n(c), with fcmax = max^ ki and (fc™) = N^^ fc™: 

n(c) > -^N^{kf + ^N{k) - iiV(fc2)(2fc„,ax + 1) (39) 
n{c) <\N^{k)^ + \N{k) - \N{e) (40) 

Let us test the full expression p8|) and the strength of the two bounds P9l40p against three qualitatively 
different but explicitly verifiable cases: 

— Fully connected graphs Cij = 1 — (5^ : 

Here one should find n{c) =0 (since no edge swaps are possible). The degrees of c are ki ^ N—1 
for all i, and its eigenvalues are A = iV— 1 (with multiplicity 1) and A = — 1 (with multiplicity 
N-1). Hence Tr{c^) = (A^-1)[(A^-1)3 + 1] and Tr(c3) = N{N-l){N-2), and substituti on of t hese 
properties shows that ((38)) indeed gives correctly n(c) = 0. In this example the bounds ((39l40p are 
found to be weak, reducing to \N{N-1)[1-{N~1)^] < n{c) < lN{N-l)[l + {N-l)^]. 

— Periodic chains — (^i.j-i + ^ij+i (mod N), N > 4: 

Direct inspection of the possible edge swaps reveals that one should find n{c) = iV(iV — 4). In 
this ring-type graph ki = 2 for all i, (c^)y = <5ij+3 + S^i^+i + 36ij-i + <5,;.j-3 and (c'*)y = 
+ 4(5ij+2 + Q6ij + 4(5ij_2 + It follows that Tr(c'*) = 6N and Tr{c^) = 0, and our 

formula ((55)) is seen to reproduce n{c) — N{N — A) correctly. The bounds ()39l40p are now rather 
close, giving N{N—^) < n(c) < N{N—^). In fact one obtains the same leading two orders in 
N of this result for n(c) also in regular random graphs with p{k) — Sk,2, where the eigenvalue 
distribution of the adjacency matrix is [34] 

„„, . (41, 

^ The occurrence in (|38p of the terms Tr(c'') and Tr(c'') imply a strong connection between a graph's mobility 
n(c) and its loop statistics, and it might be instructive to pursue this link somewhat further. If we call L„(c) the 
number of loops of length n in c, one finds upon correcting for over- counting, that ^3(0) = | "l^ijk CijCjkCki ~ 
iTr(c^) and that ^4(0) = 1 1],^^^ (1 - Sji){l - &ik)cijCjkCktcii = |Tr(c'') - 1 fc? + | J2i ^i- Hence we may 
write ((38)) alternatively as n(c) = hY - \ kidjkj + 2L4,{c) -j- 3L3(c). 
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here one finds 

n(c) = iV^ - JdX q{X)[X^ + 2X^] + o{N) 

= N + 8N — / +n(N) = N(N-A)+n(N) (42) 

2 Jo vl-a;2 

— Two isolated links: ci2 — C21 = C34 = C43 = 1, with all other dj — 0: 

Here one should get n(c) — 2. The degrees of c are ki = k2 — — — 1, with ki ~ for 
i > 4. The eigenvalues are A = ±1 (each with multiplicity 2) and A = (with multiplicity iV— 4), so 
Tr(c'') — 4 and Tr(c'^) = 0. Again our formula p8|) is confirmed to be correct, reducing to n(c) = 2. 
The bounds p9l40p now give 2 < n{c) < 4; here the lower bound is satisfied with equality. 

It is not generally possible to simplify our formula psp for n(c) further in terms of the degree sequence 
only (had it been possible, n(c) would have dropped out of the transition probabilities ([9])), as this 
would require that the third and fourth moments of the eigenvalue distribution g{X) of any graph can 
be written in terms of its degree sequence. Indeed, below we will discuss several examples of graphs 
with the same degree sequence that have different mobilities. 



3.3 Markov chain transition probabilities for edge swaps 



Our next task is to simplify the general expression ^ for the transition probabilities of the appropriate 
Markov chain, for the case where the set of moves <P' is defined as above. We have 



w{c\c')^ E E 

i<j<k<l a<3 



Iijki;a{c') r<5c,_Fijfcf.„c' 



n(c') 



'i[B(F,,fe«-„c')-£;(c')] ^ g^^,Q^lE{F,,M;o.c')^E(c')] 



-i[-B(i^.,7=f;oc')--E(c')] _,_ Q^lE(F,,ki:cc')-E{c')] 



where E{c) = H{c) + logn(c). We can also write this as 

Iijk£;a{c') <5c,_F,jfc£.„c' 



c^c' : W{c\c') 
c = c' : W{c'\c') 



E E- 

i<j<k<£ a<3 



n(c') 1 + e^'f=*i°-^('=') 



1 



W{c\c') 



(43) 

(44) 
(45) 



with 



^ijke-aE{c) — E{Fijkl-aC) — E{c) 



Aijki;aH{c) + log 1 + Aijki;an{c)/n{c) 



(46) 



Whether or not the entropic effects (the dependence of the mobility n{c) on the graph state c) remain 
important in controlling the evolution of large graphs, i.e. for iV — s- cx), will depend crucially on the 
degree distribution and the spectral properties of c. They can only be neglected if the relative changes 
Aijke-an'{c)/n{c) in the number of possible moves due to a single edge swap are always small. It is 
therefore important to evaluate the change in the state mobility after an edge swap. 

Only in the simplest case where all degrees are bounded, and the lowest moments of the degree 
distribution and of the eigenvalue spectrum q{X) of c remain finite for ^ cxd, we would get 



Aijke-aE{c) — Zi,yfe£.Q,i7(c) 



1 



Ajfe^;aTr(c4+2c3) + 0(iV-2) 



(47) 



However, for graphs with e.g. Z\yfc£;aTr(c^) = 0{N^) it is less clear when these entropic effects can 
be neglected. Imagine, for instance, a graph with a 'dense core', such as Cij = 1 — 6ij for i,j e 
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{1, . . . , K} and Cij — elsewhere. Here one has ki = if — 1 for z G {1, . . . , K} and fc^ = for 
i > K so n{c) = 0, since this specific state cannot be changed by any edge swap. In addition, 
g{X) = N-^[5{X-K+1) + {K-l)S{X+l) + {N-K)S{X)], which gives 

(fc)^ + l(fc-2fc^) =-i^[Tr(c^+2c^)-2^fc.c..fc.] = K{K-1){K^^- 3K+3) ^^^^ 

rs 

This shows that the 0{N'^) terms in n(c) cancel (in this example even the subsequent orders do), and 
as a result the numerator and denominator of Aijki-an{c) /n{c) are of the same order. Upon choosing 
K = 0{VN), for instance, one would have a finite nonzero average connectivity (fc) for N oo, but a 
diverging value of / dA X'^g{X) such that N~'^Tt{c'^) — 0(1). We must therefore allow for the possibility 
that the entropic contribution in (|46p will remain non-negligible if one allows for small deviations from 
the above 'dense core' graphs such as to render them mobile via edge swaps. 



3.4 Ergodicity 

The question of ergodicity for switching dynamics, i.e. whether any two graphs sharing a given degree 
sequence can be connected by a finite number of consecutive switchings, has been studied by Taylor [35] , 
where a formal proof by induction is given. Taylor's arguments focus on connected graph^, and this 
represents the harder case. Indeed, ergodicity for connected graphs would immediately transfer to 
generic graphs by a simple argument. Given a non-connected graph, add an auxiliary node connected 
to every other node. This extra node creates a new, connected graph. However the new links can 
never be switched with any of the pre-existing links. Hence they are not modified by the dynamics. 
Therefore, if ergodicity holds for the auxiliary connected graph, it is automatically valid also for the 
original non-connected graph. A more rigorous proof of the latter fact can be found in e.g. [36]. 



4 Invariant measure for edge swap dynamics with uniform acceptance probabilities 

Having established the precise connection between controlled stochastic switching dynamics and the 
resulting equilibrium measures, we can use this in two ways. First, we can construct for any desired 
equilibrium measure a canonical edge-swap Monte-Carlo process that will evolve towards it. Alterna- 
tively, we can investigate edge-swap processes used by others, and calculate the equilibrium states that 
would be generated. The simplest such processes are those where at each step a candidate edge swap 
is drawn randomly and uniformly and, when possible, executed. These are studied in this section. 



4.1 Relative errors in observables upon assuming incorrectly a uniform measure 

An edge swap graph shuffling process in which all randomly drawn candidate moves are accepted and 
executed is mathematically equivalent (apart from an overall time re-scaling) to our general Monte- 
Carlo process with transition probabilities (I44I45I46|) . if in our formulae we make the choice H{c) — 
— log n{c) . Since the general process ()44|45|46p has been constructed such as to evolve to the equilibrium 
measure Poo(c) = exp[— ii(c)] we know that the one where all randomly drawn allowed edge swaps 
are accepted, must evolve to the stationary measure Poo(c) = exp[logn(c)] = Z^^n{c), i.e. to 

Poo(c) = |{(fc)^+l((fc)-2(fc2)-l^fc,c,,A;,) +^Tr(c4 + 2c3)} (49) 

where Z is determined by the normalization condition X]cer2[k] Poo(^) ~ ^- '^^^ ^^'^^ express the 
actual equilibrium expectation values {G{c)) — J2c£n[u]P°°i^)^i^) '^^ graph observables in terms of 

^ Actually, in [35] it is shown (Theorems 3.1 and 3.2) that ergodicity holds for generic connected pseudo- 
graphs, a class of structures that allows for multiple links between edges and for loops of length 1, which is 
much broader than just the simple graphs we consider here. The present case is covered by Theorem 3.3 in [35]. 
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type A type B 



k 




I 

Fig. 1 Connected parts of 'nearly hard-core' graphs, as defined by a degree sequence (modulo node permu- 
tations) of the form: = K - \ iox i < K, ki = \ ior i ^ {K + \,K + 2}, and fci = for i > ii" 4- 2. 
In the present figure K — 10, and the nodes i > K + 2 with zero degree are not shown. Left: the graph of 
type A, with ck+i,k+2 = 1, of which there is only one; here n{c) — K{K — 1). Right: the graphs of type B, 
with ck+i,k+2 = 0, of which there are K{K — 1) (one for each choice of k and £); here n(c) — 2{K — 1). An 
equilibrated stochastic edge swap dynamics with randomly drawn candidate edge swaps and state-independent 
acceptance rates would visit the graph A with probability p(c) = 1/[1 -I- 2{K — 1)], and each of the K{K — 1) 
type B graphs with probability p(c) = 2/[K{l + 2{K - 1))]. 



what would have been found for a strictly flat measure, i.e. in terms of averages of the type {G{c))q = 
I^MI^^ J2cen[k\ ^(c)' where all graphs with the prescribed degree sequence k having equal weight: 

(G(c)) - (G(c))o _ Ecer2[k] «(c)G(c) i 



(G(c))o Ecer2[k]«(c) (G(c)>o 

(n(c)G(c))o-(n(c))o(G(c))o 



(n(c))o(G(c))o 

^"^^^ (cg);"^"" - (Tr(c^ + 2c3))o - 2 hk, [^^^ - (c,-)o] 
iV2(fc)2 + iV[(A:)-2(fc2)]] + (Tr(c4 + 2c3))o - 2^,^. fc,fc,(cy)o 



(50) 



According to ([5]), the correct dynamics leading to the uniform measure p(c) = |i7[k]| ^ would have 
corresponded to the following acceptance probabilities for proposed edge swaps c' — > c: 

A{c\c') ^n{c')/[n{c')+n{c)] (51) 

A sensitive marker of deviations from uniform sampling of graphs by poo (c) should be the observable 
n(c) itself ([38]) . for which one finds the relative error 

(n(c)) - (n(c))o _ {n\c))o - {n{c))l 



i(c))o (n(c)) 



2 > (52) 





with equality if and only if ?i(c) = n(c') for all c, c' G J7[k], i.e. if the measure Poo(c) is flat. Let us next 
acquire some intuition for the degree sequences k = (ki, . . . , fcjv) where the incorrect assumption of a 
uniform equilibrium measure over /?[k] would lead to significant errors in expectation values. These 
situations occur when the mobilities ?^(c) vary significantly from one graph to another. 



4.2 'Nearly hardcore' graphs 

As an example let us consider a choice for the degrees k = (fci, . . . , fc^r) that corresponds to a 'nearly 
hardcore' graph (see Fig. [T]), involving a fully connected core of size K and two degree-1 nodes (note 
that for a fully 'hardcore' graph with K ~ N there would not be any allowed edge swap): 



i<K: ki=K-l, i = K + l,K+2: fe, = 1, i>K + 2: h = (53) 
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Here we have p{k) — ^Sk^K-i + jfSk^i + ^ § "^ ^kfi, and hence formula (155)1 gives 
4n(c) = K'^{K-lf + bK{K-l) - 2K{K-lf + 2 + Tr(c'' + 2c^) 

K K 

^2{K-lf -4(if-l)^KK+i +Q,A-+2] -4cK+i,K+2 (54) 

ij — l 2—1 

There exist K{K—\) + 1 such graphs. Close inspection of their possible realizations reveals only two 
types, A and B, which are characterized by whether or not the two degree-1 nodes are connected to 
each other, see Figure [T] 

- Type A: 

c/<-+i,K+2 = 1, Cy = 1 V(i, j), i < j < ivT, = elsewhere (55) 

Here we have the two degree-1 nodes are connected to each other, plus one fully connected core of 
K nodes. There is just one such type-A graph c, which we will call ca- It allows only for edge swaps 
involving the two degree-1 nodes and any two nodes taken from the core, so n(c) = K{K — 1). The 
eigenvalue distribution of c is 

^,(A) ^ ^^(A-1) + §'5(A+1) + ^5{X~K + l) + ^~^~^ ^(A) (56) 

and hence formula (|54|) indeed reproduces the correct mobility n(c) = K{K — 1). 

- Type B: 

CK+i,K+2 = 0, 3(A;,£ < iiT) such that Cfc^^+i = q,a:+2 = 1, 

=1 V(i,j), i<]<K, i,H{kJ-} (57) 
Cij = elsewhere 

Here we have de facto carried out an edge swap relative to ca, and replaced both the link between 
the two degree-1 nodes and the link between two nodes {k, £) of the core by two new links: one from 
node k to degree-1 node K + 1, and one from node £ to the other degree-1 node K + 2. We will call 
the resulting graph CB-k,e- There are K{K— 1) such type-B graphs, one for each choice of (fc, £), but 
each of these allows only for 2{K — 1) edge swaps. In fact, the two outlying edges can be switched 
among each other in two ways, the one that brings us back to ca and the swap that is equivalent 
to replacing the links Ck.K+i and by the links and ci^k+i- In addition, each of the 

outlying edges can be swapped separately in one way with every edge that has a node in common 
with the other outlying edge (this is easily checked directly). There are K — 2 such edges. Hence 
7i(c) = 2 + 2{K — 2) = 2{K — 1) for each state of type B. The spectrum of each type-B state is 

g{X) = ^^(A-l) + ^d^A + l) + ^~q{\) + ^~^~^ J(A) (58) 

in which the normalized distribution ^(A) represents an eigenspace of dimension K with three 
further eigenvalues, to be solved from (A + 1)(A^ — 1) = {K — 2)(A^ + 2A — 1). 



We see explicitly that here the edge swap dynamics is indeed ergodic on the space of allowed graphs. It 
follows from the possible graphs c and from their associated mobility numbers n(c) as identified above, 
that for the present choice of degrees one will find under edge swap dynamics with state-independent 
acceptance probabilities the following equilibrium measure: 

1 2 ^ - 

^ 1 + 2(^-1)'^-- + K[l + 2{K^l)] ,5/''^"""'^ ^^^^ 

This measure is far from uniform. It will now depend crucially on which quantity one measures during 
the graph evolution process to which extent this probability inhomogeneity will manifest itself. If we 
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were to measure, for instance, the expectation value of the hnk ck+i,k+2 (which is only present in the 
state ca) we would find from the true equilibrium measure (|59p : 



{cK+i,K+2) = [1 + 2{K-1)]-^ (60) 
whereas for the uniform measure p{c) = |]7[k]|^^ = [K{K—1) + 1]^^ one would have found: 

(cK+i,A'+2)o = [l + KiK-l)]-^ (61) 

Expressions (|60p and ([61]) are identical only tor K = 2 (where the 'dense core' is indeed no longer 
present, and the graph reduces to two disconnected links), and differ significantly for increasing K: 

{cK+l,K+2) - {cK+l,K+2)o ^ {K - 2){K - 1) 

{cK+l,K+2)o l + 2{K~l) ^ ' 

Assuming (incorrectly) a uniform equilibrium measure on J7[k] would grossly underestimate the true 
likelihood to observe the bond ck+i,k+2- Similarly, if we calculate for both ((59l) (obtained for 'accept 
air implementation of randomly proposed edge swaps) and for the uniform measure on i7[k] (obtained 
for the correct acceptance rates ([?T|) ') the average mobilities {n{c)) and (ri(c))o, respectively, we find 

n c) = -i '-^ n(c) = — 63) 

\ V ;/ 2K-1 ' \ ^ ''/o K{K-l) + 1 ^ ^ 

Again these are identical only when K = 2, whereas {n{c)) > (n(c))o in all other cases. 

Degree statistics such as in the present example do not strike us as far-fetched. If we were to add 
to the present K+2 nodes another N—K—2 'dummy' nodes with degree zero, then our enlarged graph 
would have (fc) = [K{K-l) + 2]/N and (fc^) = [K{K -if + 2\/N . Choosing e.g. K = ^/^TV would 
then give a finite average connectivity [k) — 4> + 0{N^^/'^) and a diverging width, [k"^) = ©(iV^/^), 
identical to what one would have found for scale-free graphs with p{k) ~ Nothing in the first 

two moments of the degree distribution could therefore be regarded as severely pathological. 



4.3 Conditions under which the invariant measure will become uniform for iV — > oo 

It is clear from the above example that any results obtained from measuring graph observables numer- 
ically during edge swap dynamics with state-independent acceptance probabilities should be accompa- 
nied by an explicit proof that for the degree sequence under consideration and for the relevant graph 
size the equilibrium measure on the space of graphs with this given degree sequence can be taken as 
uniform. According to (|44l45l46p . our stochastic process will evolve towards a state with uniform graph 
probabilities on J7[k] (the set of graphs characterized by an imposed degree sequence) if it corresponds 
to H{c) = constant for all c € J7[k]. From this it follows that for — > oo the canonical edge swap 
acceptance probabilities lose their dependence on c as soon as 

lim A,,M;<. logn(c) = lim log [l + ^MM^^l = Q (64) 

N^oo ■' ' o w N^oo I n{c) J 

In appendix [B] wc prove bounds for n(c) and Aijki-an{c). In particular, wc may use (|95p and (j97p . viz. 

n(c) > ^N^{kf + ^N{k)-^N{k^)ikr^,^ + ^) (65) 
\A,jM;an{c)\ < ^N{k^)k^,^ (66) 



to establish 



\Aijke-an (c)| 2An , {k^){k max ' n / 

^ 1 ' = 777731 (67) 



n(c) - 1-2An' N{ky 
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This latter formula thus allows us to identify conditions under which the simple process with state- 
independent acceptance probabilities of randomly drawn candidate edge swaps will lead to an effec- 
tiveljS uniform measure. Condition ([64]) will be satisfied if /I at ^ 1, i.e. as soon as 

(fc2)fcmax/(fc)' « N (68) 

Clearly, degree sequences of regular random graphs, viz. p{k) = dk,k* for some finite connectivity k* , 
meet all the conditions for finding asymptotically a flat invariant measure. However, one has to be 
careful with scale-free degree sequences, where both (A:^) and fcmax diverge as N ^ <x. 

It is instructive to inspect condition (|68p for the 'nearly hardcore' graphs in the previous subsection, 
where p{k) = ^Sk.K-i + '^^k,i + ^^—^—-^^k.Oi and where for state-independent acceptance rates one 
does not have a flat invariant measure for any K > 2. Here (fc) = K{K—1)/N + 2/N, (fc^) = 
K{K''1)^/N + 2/N and fc^ax = K-1, so 

(fc^)fcmax ^ [K{K-l)^ + 2]{K-l) 

N{ky [K{K-l) + 2]^ ^ ' 

For all iiT > 2 the ratio ([69)) stays finite, so (|68|) is indeed violated. 

Note that in the previous example also K — 2 would violate ([68]) . yet in this case one would have 
found a flat measure even for state independent acceptance rates. It should therefore be emphasized 
that condition (j68p has been derived as sufficient for having effectively a uniform invariant measure. 
It is, however, not a necessary condition; there exist indeed degree sequences that violate (j68p but still 
give a flat invariant measure. This is easily seen upon inspecting simple examples such as the 'star-like' 
graphs. For instance, the graph characterized by the degrees ki = 1 for i < A'^ — 4, fcAr_4 = k^^^ = 
kN^2 = few-i = 2, and k^ = N — 1, corresponds to a central node N connected to N — 5 degree-1 
nodes and connected to two further loops of length three. This degree sequence has lim7v^oo(fc) = 2, 
but (fc^) = 0{N) and fc,nax = 0{N), and hence violates ([68| . Yet there are only three allowed graphs 
which each have n{c) = 2, so the invariant measure is flat. For some graphs it is in fact possible to 
sharpen the condition (j68p further, by using alternative bounds for the various terms in Aijki-an{c)] 
examples of these are derived at the end of appendix [Cl 



5 Generating random graphs with prescribed degree correlations via edge swaps 

Our second application of the general formalism is the construction of Monte-Carlo processes that 
evolve towards the equilibrium state corresponding to random graphs with prescribed degree sequences 
and controlled non-uniform measures. For instance, one could impose non-uniform measures in order 
to impose specific degree correlations, which is achieved by the ensemble studied in [37], 

Poo(c) = |n ^Q(fc-fcj)'5c.„i + (l-^Q(fc„A;,))5e,,,o l[^k^M<^) (70) 

i<j L J i 



with Q{k,k') > for all {k,k') and J2kk' Pi^)Pi^')Q(^'^') ~ ^- This ensemble gives the maximum 
entropy within the subspace of graphs with prescribed degrees and upon imposing as a constraint the 
average values II{k,k') — {II{k, k'\c)) of the relative degree correlations, where 77(fc,fc'|c) is defined 
as follows 

Kk'>0: njKk'lc) ^ ^^'^^/'^^'^'(f^'^'^^i^ ik){N-l) ^^^^ 

l^i^j"k,k,{c)Ok' ,kj{c) kk' 

n{k, k'\c) gives the probability that two randomly drawn nodes of c with degrees (fc, k') are found to 
be connected, divided by the probability that this would be true in random graphs drawn from p{c) = 

* By 'effectively uniform' we mean that the canonical edge swap acceptance probabilities + e^'J'='^;°^''^ 
in (|44p that would give us a rigorously uniform invariant measure become for N ^ ex indistinguishable from 
1/2 (i.e. from state-independent acceptance probabilities). This still does not rule out the possibility that 
certain pathological observables could be defined for which the asymptotically vanishing deviations from 1/2 
could add up to a non- vanishing effect. 
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,fc.(c) 



For the ensemble (iTOl) one finds for 



wliere F{k) is to be solved from 

Vfc > : Q{k, k')p{k')k'/F{k') = (fc)F(fc) 



that n{k,k') = Q{k,k')/F{k)F{k'), 



(72) 



Conversely, for each desired function II{k,k') one may always choose Q{k,k') — n{k, k')kk' / (k)^ in 
((70)) and find this ensemble subsequently generating graphs with the required degree correlations. 
See [38] for proofs of these statements, and for further mathematical properties of 7T(fc, k') and the 
ensemble (|70|) . 

The controlled non-uniform measure ([70| can be generated via stochastic processes as studied in 
the present paper. In the language of our processes (|44l45l46p . it simply corresponds to the choice 



H{c) = -J2log [^Q{k,,k,)S,,^., + (l-^Q{k„k,) 



(73) 



If we now work out the implications for ()44|45l46p of choosing ([73]) . we find this process describing 
the random drawing of candidate edge swaps Fijki-a, upon which the proposed transition c — > Cnow = 
Fijke-aC is then accepted (and executed) with the acceptance probability 



^(Cncw|c) — 



Tt(Cncw) gff(c„c„)--ff(c) 

n(c) 



(74) 



With the sets Sijkt,a introduced in (|29l30l3ip that specify which index pairs are affected by the proposed 
edge swap, we find for the function (|73p that 



,)-ff(c) ^ -Q 

(a,6)eSijfc£:Q 

= n 



M. 

N 



Q{ka,h)){l - Cab) 



^Q{ka, k.Kr + (i- #Q(fca, k,)) (1 - csr) 



LabSc 



KbSc 



(where we used the property Cab = 1 



for all (a, 5) e 5'yM;a), with 

Lab = N/[{k)Q{ka,kb)]-l 



(75) 



(76) 



If we start from a physically realizable function II{k, k') (see [38] for the precisely mathematical 
conditions for realizability) and if we use the canonical kernel Q{k,k') = n{k, k')kk' / {ky in our 
ensemble (such that for TV ^ oo it will generate graphs with relative degree correlations 7T(fc, k')), the 
latter parameters become Lab = N{k)/[II(k,k')kk'] — 1. 



6 Numerical tests 

We have conducted extensive numerical experiments on a variety of graphs to confirm the validity of 
formula ([55)1 for n(c) and found an impressive agreement. The calculation of the terms Tr(c'^) and 
Tr(c^) in n(c) at each time step, required in the calculation of the acceptance probabilities of the 
canonical Markov chain, is cpu-intensive. However, for finitely connected graphs working out these 
traces can generally still be done in 0{N) steps by efficient use of arrays with indices of the neighbours 
of each node, as opposed to brutal matrix multiplication. Alternatively one could calculate the traces 
only once at the start of the simulation, and update their values on-line by using formulas (|104ll05p 
in appendix [Cl 
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-1 



0.01 







100000 



200000 



300000 



400000 



500000 



executed moves 



Fig. 2 Re-scaled running av erages n{c)/N'^ of the graph mobiUty (|38p . measured during numerical simulations 
of Markov chains of the type (I44I45|) . The time unit is the number of executed edge swaps, and running averages 
are measured over time windows of 10,000 successive states c. All data refer to the nearly hardcore graphs 
shown in figure [T] with K — 98 and A'^ = 100. Full circles: observed graph mobility if all randomly proposed 
edge swaps that are possible are executed, i.e. when ^(c[c') — 1. Open circles: observed graph mobility if 
randomly proposed edge swaps that are possible are executed with the canonical acceptance rates (|5ip . i.e. 
when j4(c|c') = [1 + n{c)/n{c')]~^ . Dashed horizontal lines: the corresponding theoretical predictions (|63|l for 
the equihbrium mobilities, which for K = 98 give {n{c))/N^^ 0.0242 (upper) and {n{c))o/N^^ 0.0195. 

6.1 Accept-all edge svi^ap dynamics versus edge-swap dynamics vifith correct acceptance rates 

Our first simulations were carried out for the 'nearly hardcore' graphs of figure [1] We ran two different 
Monte-Carlo processes as described by (I44l45p . In the first we accepted all randomly generated possible 
edge swaps, i.e. A(c|c') = 1, whereas in the second process we used the canonical acceptance rates 
(I5T|) . i.e. A(c|c') = [1 + n(c)/n(c')]^^. This allowed us to verify the theoretical predictions that in the 
former process the system will evolve towards the non-uniform measure Poo(c) = ^^(c)/ X]c'ef2[k] "-(c') 
whereas in the latter 'the system evolves towards the flat measure (c) — |i7[k]|~^. This verification is 
easiest upon measuring time averages of the mobility n{c) itself, for which we have derived the (exact) 
expressions (|63| . In figure [2] we show for both processes the observed mobility time averages n(c) as 
measured over successive time windows of 10^ accepted moves (which gives us also information on when 
the system can be regarded as in equilibrium), together with the predicted equilibrium values (j63|) (as 
dashed horizontal lines). We conclude that in these graphs there is perfect agreement between theory 
and the simulations, and that indeed one cannot generally assume 'accept all' edge swap randomization 
to lead to an unbiased sampling of i7[k]. 

As our second synthetic example system, we choose as our starting point the undirected version of 
the graph studied in [16], shown in figure[31 to be called graph A (or c^). It consists of N nodes, two of 
which (labeled 1 and N) have degree N — 2, whereas the remaining iV — 2 nodes have two connections 
each. We note that n{cA) = {N — 2){N — 3), since it is possible to switch in one way every two links 
joining opposite sides of the central 'wall' provided they don't have a node in common (this mobility 
value can of course also be calculated from ([38|l ). Any possible edge swap executed on ca will bring 
us to a graph of type B, see right diagram in figure [31 of which there are {N — 2){N — 3). It follows 
that |i7[k]| = 1 + {N — 2){N — 3). Looking at Fig. [3l we see that to compute the mobility of type B 
graphs one has to take into account the facts that: (i) we can switch in one way the links such as (l,p) 
and {q,N), and (ii) the link {q,p) can be switched in one way with each of the links (l,i) and {j,N) 
such that i ^ {p, N} and j ^ {l,?}- There are — 4 such links in each group. Hence including the 
switch that brings us back to type A we have n{cB) = 1 + 2{N — 4) -I- 1 = 2{N — 3) (again, one could 
also have used formula l|55)) to find this result). If one carries out an 'accept all' edge swap process 
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N 



Type B 



Fig. 3 Graphs used in our second numerical experiment (the undirected versions of t he grap hs introduced 
in [16]). Left paneh the graph chosen as the initial state of the Monte-Carlo process (|44l45p (to be called 
type A), with K = N — 2 'central' nodes of degree 2 and two nodes of degree K. This configuration has 
n(c) = K{K — 1). A generic switching of a type A graph, e.g. one involving the four nodes {l,p, q, N}, leads 
to one of the K{K — 1) type B graphs, of which an example is shown in the right panel. Each of the type B 
graphs has n(c) = 2{K — 1). 
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Fig. 4 Distribution Q{f) re-scaled frequencies at which the accessible graphs are visited during switching 
dynamics of the type (|44I45[) over 2 • 10® time steps, starting from a type A graph of figure|3]with A'' = 10. Here 
|J?[k]| = 57. A uniform measure p(c) = |f2[k]|~^ would give Q{f) = S{f ~ 1). In the left panel, we see the results 
for an 'accept all' switching dynamics, viz. y4(c|c') — 1, with the arrow pointing to the peak corresponding to 
the time spent in the type A graph. In the right panel, the same quantity is reported for a switching dynamics 
with the canonical acceptance probabilities A(c|c') = [1 -f n(c)/n(c')]~^. 



starting from c^, our theory predicts that upon equilibration this would give the following measure on 
the space i7[k] of graphs with degrees sequences identical to that of c^: 



Poo(ca) 



n{cA) 



n{cA) + {N-2){N-3)n{cB) 2N - 



(77) 
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13.2 I 2 I 

200000 400000 600000 800000 1000000 200000 400000 600000 800000 1000000 

executed moves executed moves 

Fig. 5 Left: re-scale d runn ing averages n(c)/N'^ of the graph mobiUty (|38|l . measured during edge swap 
dynamics of the type (|44l45p in the human protein interaction network [39] with A*' — 9463 nodes and average 
connectivity (fe) « 7.402. The time unit is the number of executed edge swaps, and running averages are 
measured over time windows of about 3800 successive states c. Full circles: 'accept all' edge swap dynamics, 
i.e. j4(c|c') — 1. Open circles: randomly proposed and possible edge swaps are executed with the canonical 
acceptance rates (jSip . i.e. y4(c|c') = [l + n(c)/n(c')]~^. Right: corresponding measurements of Tr(c"^)/A'^, which 
is proportional to the average number of length-3 loops per node in the network. 



and for each type B graph 



n(cB) 



(78) 



n{cA) + iN-2){N-3)n{cB) {N-2){2N-5) 
Hence, if we measure during the 'accept all' edge swap process the relative frequency graph distribution 



Qif) 



J— J2 5[f-\m\poo{c 



\f2[k]\ 



(79) 



cen[k] 



we should find 



Qif) 



l + {N-2){N-3) 



f 



l + (iV-2)(7V-3)i {N-2){N-3) 



2N~5 



l + {N-2){N-3) 



f 



2[l + {N-2){N-3)] 
{N-2){2N-5) 



(80) 

(as opposed to the result Q{f) = 5{f — 1) that would be obtained for a flat measure on i7[k]). We have 
carried out numerical simulations of the edge swap dynamics (|44|45p . first on an 'accept all moves' basis 
and second using the canonical acceptance probabilities A(c|c') = [1 + n(c)/n(c')]~^, for graphs as in 
Figure[3]with TV — 10. Here one expects to find Q{f) = 5{f — 1) for canonical acceptance probabilities, 
but for the 'accept all' edge swap dynamics we should get: 
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(81) 



The results are shown in figure 21 The distribution Poo(c) generated by the 'accept all' edge swap 
dynamics is indeed not uniform: the smaller peak in Qif) appearing at the predicted value / = 
19/5 = 3.8 reflects the visits to the type A graph, which occur more frequently due to its larger 
mobility. Conversely, with the canonical acceptance probability A(c|c') = n(c')/[n(c) + n{c')] (which 
here takes the three possible values A{ca\cb) — 0.2, A{cb\cb) — 0.5, and A{cb\ca) = 0.8) the 
resulting equilibrium measure poo(c) of the process is indeed flat, i.e. Qif) = 6{f — 1). 



In order to assess to what extent the differences between 'accept all' and correct edge swap dynamics 
manifest themselves in large and realistic graphs as studied intensively in biology and bio- informatics. 
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we have also carried out edge swap siraulations (similar to those described above for synthetic graphs) 
in protein interaction networks. We chose the most recent data for the human protein interaction 
network [39], giving a graph of = 9463 nodes and degree statistics (fc) « 7.402 and (fc^) « 248.7 
(with fcniax = 247). For this data set the simple condition (|68p does not yet permit us to rely on 'accept 
air edge swap randomization as a safe algorithm for unbiased randomization, since here one finds 

(fc')fcmax/(fc)'iV « 0.12 (82) 

As with the synthetic graphs, we ran two different Monte-Carlo processes as described by (j44l45|) : 
'accepted all' edge swaps dynamics, i.e. A(c|c') = 1, and edge swap dynamics with acceptance rates 
((5T|l . i.e. A(c|c') = [1 + n(c)/n(c')]~^ (to guarantee a uniform equilibrium measure). The results are 
shown in figureO Here the differences between the two types of dynamics are seen to be negligible, both 
in terms of the observed mobility n{c) and in terms of quantities such as Tr(c'^) (which is equivalent 
to counting the number of length-3 loops in the network). One finds similar results for the available 
protein interaction data of other organisms. This is a relevant observation, since graph randomization 
via 'accept all' edge swapping has in the past been used to quantify the relative frequency of small 
network modules of 'motifs' in biological networks, of which length-3 loops are just a primitive example, 
or to quantify the relevance of observed degree-degree correlations; see e.g. [40-42]. It follows that those 
who have in the past used 'accept all' edge swap dynamics to randomize protein interaction networks 
have been fortunate, in that for the available data the incorrect sampling resulting from this dynamics 
does not appear to interfere with observation. However, since it is now generally agreed that the 
presently available incomplete protein interaction data are biased samples of the full proteome, there 
is no guarantee that when data have become more complete and representative the simple but in 
principle incorrect 'accept all' edge swap randomization will continue to work in practice. 



6.2 Simulations to produce controlled non-uniform measures 



Our final simulations involved Markov chains of the form (|44l45|46p tailored to evolve towards con- 
trolled non-uniform equilibrium measures. Here we tested the prediction that the edge swap process 
with edge swap acceptance rates defined by (j74l75l76|) will evolve towards the nontrivial measure 
([70)1 . To test this, we measured the relative degree correlations n{k,k'\c) as defined in (|7ip upon 
equilibrating the edge swap dynamics (I44|45|46|) with acceptance rates (|74I75I76|) . and compared the 
result with the theoretical prediction extracted from ([70| . For the kernel Q{k,k') in (|70p we took 
Q{k,k') = C~^{k — k'Y, with normalization dictating that C ~ 2((A;^) — (fc)^). For sufficiently large 
N, and given that the measure is indeed ([70|l . the predicted values for n{k, k') are 



n{k, k') ^ C-\k - k'f /[F{k)F{k')] 

where F(k) is to be solved from ([72]) . which here gives F{k) — (as — 2a2^ 
the three coefficients at are to be solved numerically from 



E 



k^p{k) 



as — 2a2k -t- aik^ 



(83) 



ai 



k'^)/y/(k)C, in which 



(84) 



The predicted relative degree correlations are then given by 



n{k,k') 



(k){k~k') 



[as — 2a2k + aik^][a3 — 2a2k' + aifc'^] 



(85) 



We generated a simple synthetic initial graph Cq with N = 4000 and (fc) — 5, with the non-Poissonian 
degree distribution shown in figure [6] (top left). Its relative degree correlations n{k, k'\co) were found 
to be all close to one (being the value for all n{k, k') that one would have found for the flat ensemble 
p(c) — Z~^Yli^ki,ki{c))! see figure |6] top right. After iterating the Markov chain (|44I45I46|) . with 
canonical acceptance rates (|74l75l76p tailored to approaching (j70p as an equilibrium measure, and 
upon reaching what appeared to be equilibration (after 75P00 accepted edge swaps), we found indeed 
values for the degree correlations that were in very good agreement with those corresponding to the 
nontrivial target ensemble ([70]) (shown in the bottom panels of figure [6]). Note that perfect agreement 
is expected only for N ^ oo. 
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n{k,k'\co) 




Fig. 6 Results of edge-s wap simulations tailored to generating equilibrium random graph ensembles with 
the non-uniform measure (|70|l . Top left: degree distribution of the graph, with — 4000 and (fc) — 5. Top 
right: colour plot of the relative degree correlations 7T(fc,fc'|co) as measured in the (randomly generated) 
initial graph cq. Bottom left: colour plot of the predicted relative degree correlations (|85|l . corresponding to 
Q{k,k') = (k-k'f/C, which is found for iV — > oo in the ensemble ((70} (the target of our edge swap process) . 
Bottom right: colour plot of the relative degree co rrelations 77(fc, fc'|cflnai) in the final graph Cflnai, mea sured 
after 75P00 accepted moves of the Markov chain (|44l45l46p with canonical acceptance rates (|74I75I76[) . The 
latter is indeed in good agreement with the prediction (bottom left) corresponding to the target measure (fTO]). 



7 Conclusions 

The mathematical analysis of stochastic processes occurring in the space of graphs with prescribed 
properties presents a number of intriguing and challenging features, that have just started to be ad- 
dressed in the language of statistical mechanics. From a physical viewpoint, understanding graph dy- 
namics is important in particular when seeking to discern the basic features of graphs (the constraints) 
from the ones that are induced by them, which generically characterize the ensemble of graphs satis- 
fying the constraints. It turns out that the mobility, namely the number of allowed moves away from 
a given state, is a central quantity for unraveling several properties specific of such processes. In par- 
ticular, the very dependence of the mobility on the specific state (which in principle can persist even 
in an infinite graph) is responsible for important entropic effects that may prevent a simple switching 
dynamics (simple in the sense that all randomly generated and possible moves are executed) from sam- 
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pling uniformly the space of graphs compatible with given constraints. Uniform sampling is especially 
desirable in applications where one is cither interested in testing the robustness of certain graphical 
properties against graph "randomization" or where one aims to generate random graphs that satisfy 
a given set of constraints but are otherwise unbiased. 

In this paper wc have constructed a general framework for constrained stochastic graph dynamics, 
and derived an explicit and relatively simple expression for the graph mobility in the case where the 
dynamics is based on randomly generated 'edge swaps' or 'switchings'. This latter expression allowed 
us to: (i) define Monte-Carlo processes that are guaranteed to converge to any desired measure on the 
space of graphs with a prescribed degree sequence, (ii) calculate explicitly the equilibrium measure 
that would be found for 'accept all' edge swap dynamics (which will generally not be uniform), and 
(iii) identify practical conditions on the graph topology that are sufficient to ensure that even the 
equilibrated simple 'accept all' edge swap dynamics will give a uniform dynamical sampling of the 
accessible graphs in the limit where the number of nodes N ^ oo. 

We have carried out Monte-Carlo simulations of both synthetic graphs and of biological networks 
describing protein-protein interactions, in which we compared the results of executing 'accept all' edge 
swap dynamics versus edge swap dynamics with correct acceptance probabilities tailored to producing 
unbiased equilibrium measures. Wc also carried out simulations of stochastic edge swap dynamics that 
are tailored to generating random graph ensembles with controlled non-uniform measures, characterized 
by nontrivial imposed degree correlations. All numerical simulations lend convincing support to our 
theoretical predictions, and underline the importance of the graph mobility in the construction of 
correct Monte-Carlo acceptance probabilities in constrained graph dynamics. 

We have taken the approach of formulating stochastic graph dynamics within a general statistical 
mechanics framework, where the constrained Markov dynamics of graphs is treated similarly to a spin- 
flip dynamics in magnetic systems (albeit that in the latter the issue of state mobility does not arise, 
not even in the case of constrained Kawasaki- type dynamics [1]). This suggests that future studies of 
graph dynamics may well reveal a rich (and possibly unexpected) phenomenology paralleling that of 
magnetic spin systems. 

Acknowledgements It is our pleasure to thank M. Marsili and F. Ricci Tcrscnghi for thoughtful comments 
and suggestions, and L. Fernandes for kindly providing us with protein interaction network data. 
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A Edge swaps as minimal degree-preserving moves in graphs 

For the benefit of tlie reader, we review in tfiis appendix briefiy in tlie language of tlie present study the 
arguments leading to the identification of edge swaps as the minimal degree preserving moves. Let us first 
characterize all possible moves _F : c ^ c' 7^ c that leave the degrees ki (c) —"^ ■ Cij of all A'^ nodes in a graph 

invariant. Since dj € {0, 1} for all {i,j), and Cij = Cji, each move F is characterized fully by specifying the 
set Sf = {(*, j)| i < j, Fcij = 1 — Cij} of node pairs that are affected by F. If all degrees are conserved, then 
also the average connectivity is conserved, so each link removed by F must be compensated elsewhere in the 
graph by a link created. Hence \Sf\ is even. We define [a,b] — {a,b) if a < fe and [a,b] — {b,a) if fe < a, and 
SF,i = G Sf} (i.e. the set of those nodes j that share with i a link that is either removed or created 

by the move F). Our first question is then: for which sets Sf of node pairs will the collective inversion of all 
links Cij — > 1 — Cij preserve all degrees, i.e. obey 

1 11 

For each node i, there is an even number of pairs affected by F, of which half see a link removed 

{cij = 1 — > Fcij — 0) and half see a link created {cij = — > Fcij = 1). Hence also \SF,i\ must be even for each 
i with bonds c rea ted or removed by F. Since It is clear that whether or not a set Sf (i.e. a move F) meets the 
N conditions (I86|) must depend on t he g raph c at hand. 

The minimal moves that satisfy (|86|l are defined as those involving the smallest set Sf ■ Let i be a site with 
bonds created or removed by F. Since \SF,i\ is even, there will be at least two further nodes {j, k) with bonds 
created or removed by F. Each of these must have their own sets Sfj and SF,k of even size, so it impossible 
for the the action of F to be restricted to the node pairs [i,j] and [i,k] alone. Thus the smallest possible size 
for Sf (which we know to be even) is \Sf\ = 4. Let us inspect the properties of moves with \Sf\ = 4 in detail: 

(i) Each node i involved in F has \SF.i\ = 2, i.e. participates in precisely two of the four node pairs in Sf- 
Proof: 

According to (|86|) each such i participates in at least two of the four node pairs in Sf- If we had \SF,i\ > 2, 
then there would be at least four other nodes involved in F (since \SF,i \ is even), each in turn participating 
in at least two pairs in Sf- The minimal set Sf would then contain Sf ~ [i, k], [i,£], [i,m], [fc,*], . . .} 

(where ★ is some node not equal to i or fc), which contradicts |Sf| ~ 4. 

(ii) F involves exactly four distinct nodes {i,j,k,£), and Sf ~ {[hj], [hk], [k,£]}. 
Proof: 

Starting from any involved node i, with Sp.i = {j, k}, we know that Sf contains Sf = {[i,j],[i,k],[k,-k], . . .}, 
where ★ is a node such that SF.k = {i, *} (so * ^ {i, k}). The first possibility is * = j. But if this were the 
case then either Sf = {[hj], [hk], [k, j]} (contradicting \Sf = 4|), or Sf = {[«, j], [i,k], [k, j], [£,m], . . .} for 
some new indices {i,m). In the latter case, however, since [SF,t[ ~ [SF.m[ ~ 2, the set Sf must contain 
additional node pairs involving £ and m, giving [Sf[ > 4. It follows that * — j is not allowed. This leaves 
us with Sf = [h k], [k,£], [j,£]} for some £ ^ {i,j, k} as the only option. 

Since we know that F acts as dj — » 1 — dj for all (i, j) G Sf, and that before the move exactly half of the 
pairs (i,j) G Sf have dj — 1, the minimal moves are (modulo node permutations) of the form of so-called 
'edge swaps' as shown below (where thick lines indicate Cab = 1 and thin lines indicate Cab = 0): 





k 



It is clear that these transitions preserve all degrees of a graph. We now know also that these are the simplest 
nontrivial transitions with this property. It is fairly straightforward to generalize the above representation, 
and show that each allowed move F with [Sf[ = rn corresponds to a set Sf that describes a closed path 
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«2 



ii) connecting m nodes, such that all d^ 



(with £ mod m). A path is 



allowed to cross itself, with ^I^f.^I — 1 giving the number of crossings at node i, but is not allowed to have 



overlapping segments. The action of F is then the inversion dj —* 1 



of all bond variables along the path. 
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B Upper and lower bounds for the graph mobility n(c) 

Here we establish simple bounds on the quantity n(c) defined in (|38p . expressed solely in terms of the degree 
moments (fc) and {k^) and the maximum degree femax ~ maxi ki. First we inspect the term with Tr(c"^). The 
only possible general lower bound is the trivial Tr(c^) > 0, since this is satisfied by all tree-like graphs with 
arbitrary degree distributions. To obtain an upper bound we use the inequality CjkCki < ^[cjk + Cki]'- 

ijk ijk i 

Next we turn to Tr(c''). A suitable lower bound can be constructed as follows: 

Tr(c'') = ^ CijCjkCktcei > ^ CijCjkCktCu&ji = ^ CijCjk = ^ = N{k'^) (88) 

ijki ijk£ ijk j 

An upper bound follows from CijCkt < ^{cij + Cki): 

Tr(c*) < ^ ^[cij + Cki\CjkCti = ^ kiCtjkj (89) 



ijke 

It follows from the four bounds constructed so far that 



(90) 



iiV(fc^) < iTr(c^) + iTr(c^) < ^N{k') + i ^ fe,c,,fc, 

Inserting into formula (|38|l subsequently gives 

n(c) > jN^{kf + iiV(fc) - iiV(fc^> - i ^ fc,c,,fc, (91) 

n{c) < jN^ikf + jN{k) ~jYl (92) 
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To proceed we need bounds for the term "^^^ kiCijkj. A simple lower bound follows from the fact that if dj = 1 
then ki > 1. An upper bound follows from ki < fcmax, and so we get 

J2 k^c^Jk, > J2 c^.fc. = E N{k^) (93) 

ij ij j 

^ ^ kiCijkj ^ kiYisLVi ^ ^ Cijkj = NkinEix{k ^ (94) 

This then leads to the following remarkably tight bounds for the mobility (note that always {k'^) > (fc)): 

n(c) > jN^{kf + iiV(fc) - iiV(fc'>(2fcn,ax + 1) (95) 

n{c)<^N'{kf + ^N{k)-^N{k') (96) 

A further corollary from this is an absolute bound on the mobility change Aijke:an{c) = Fijki-anic) — n(c) 
due to a single edge swap: 

\Aijkt;<.n{c)\ < ^N{k^)k^^^ (97) 



26 



C Effect of single edge swaps on Tr(c^) and Tr(c'*) 



Here we study the terms in (|38p that involve traces. We limit ourselves to states c on which the edge swap 
operator Fijki-a can act, since only those are required in (146 |l . First we introduce some further notation. We 
define [a, b] = (a, 6) if a < 6 and [a, b] = {b, a) ii b < a, and we denote the relevant sets of index pairs as follows: 

Spqv ^ {[p,q],[q,v],[v,p]}, Spgv;ijke;a = Spqv r\ Sijke;a (98) 

Spqvw = {[P, q], [q, V], [v, w], [w,p]}, Spqvw,ijkl;a = Spqvw H SijM;a (99) 

with in the first line p ^ q, q ^ v and v ^ p, and in the second line p ^ q, q ^ v, v ^ w, and w ^ p. 

Also, we recall that always i < j < k < £, and we associate to every set of index pairs Sijkt;a a closed 

path Vl'^ki.^ with [a, b] £ Sijkt, starting from a, along the 4 possible bonds through each pair of indices in the 
set Sijki;a, and passing through the link [a,b] in the order a to b. Index pairs with one index in common are 
visited sequentially. Hence, e.g. pj*'^,'.]^ is uniquely determined as the closed path i^j^k^l—*i. Finally, 



we indicate by &[^.'*' and a'^.'^' the index that follows b, and the index that preceeds a, respectively, along the 

and 



M 



closed path V'f.'^l^. 



For instance, j'^'.-j' = k and i^^'.-'^' = by periodicity (note that by definition b — 



[a,h] 



C.l Full expressions for Z\ijM;QTr(c^) and Z\ijfe£;Q,Tr(c'*) 

We recall that the edge swap operator Fijki-a can only affect the presence or absence of bonds in the set Sijki^a 
(where |iSijfc£;Q,| = 4), and that its action is always to create two new bonds and destroy two present ones. 
Hence 



AifcC;c«Tr(c^) = X^X^^I 



I Spqv;ijki;o 



ijfc£;ci (C[p^g] C|"q_^j C|"^^, 



n— 1 pqv 
4 



n— 1 pqvw 



(100) 
(101) 



In fact, it turns out that all terms with n > 1 must be zero: 



For each n = 2 term there are two index pairs {x, y) £ {[p, g], [q, v\, [v,pW in the case of Tr(c ) and two index 



pairs {x,y) € {[p,g], [q,v\, [v,w\, [w,pW in the case of Tr(c ) such that Fijki-.aCxy 



= 1 



If both pairs 



have Cxy = 1, or both pairs have Cxy = 0, then the conditions (I24I25I26|I for Fijki;a to act dictate that these 
two pairs have no indices in common. So the options for the action of Fijki-a are (modulo permutations): 

Tr(c^) 



Tr(c*) 





^[q,v] 


5 ^[v 


p,} = {1,1,0} ^ {1,0,1} 








C[q,v] 


■> ^[v 


p,} = {1,0,0} ^ {0,1,0} 








^[q,v] 


■> ^[v 


w] ! C^T^^p] } {1, 1, 1, 1} ^ 


{1,0,1,0} 






C[q,v] 


, C[v 


111] , C[„,p]} = {1, 1, 1, 0} 


{0,1,0,0} 


or {1,1,0,1} 


{^[p,q] 


^[q,v] 


jC[v 


w],Clw,p]} = {1,0,1,0} 


{0,0,0,0} 


or {1,1,0,0} or {1,1,1,1} 


{'^[p,q] 


^[q,v] 


■> ^[v 


w],C[w,p]} = {1,0,0,0} -» 


{1,1,0,1} 


or {0, 1,0,0} 


{^[p,(i] 


^[q,v] 


5 ^[v 


w],ciw,p]} = {0,0,0,0} 


{1,0,1,0} 





All but two cases have at least one Cxy — before the swap and at least one Cxy = after the swap; 
for those we know immediately that Aijke;a{cip^q]C[q^v]C[v,p]) = Aijke;a{c[p^q]C[q^v]Civ,w]C[w,p]) = 0. Only two 
moves in the list remain to be investigated: {1, 1, 1, 1} — » {1, 0, 1, 0} and {1, 0, 1, 0} — » {1, 1, 1, 1}. These 
last two moves involve all indices in Spqvw but are both incompatible with the action of any edge swap. 

For each n = 3 term, there are three index pairs {x, y) £ {[p, g], [g, u], [t',p]} in the case of Tr(c'^) and three 
index pairs {x,y) € {[p, g], [g, w], [v^w], [w,p]} in the case of Tr(c"') such that FijkUaCxy = 1 — Cxy. Since 
Fijki;a removes two bonds and adds two new ones, of these pairs {x,y) at least one must have Cxy = and 
at least one must have Cxy = 1. So the options for the action of Fijki-a are (modulo permutations): 

Tr(c«) 



Tr(c* 



{.^[p,q\ 


^[q,v] 


5 ^[v 


p,} = {1,1,0} ^ {0,0,1} 




{^^[pM 


^[q,v\ 


■> ^[v 


p,} = {1,0,0} ^ {0,1,1} 




{^^[pM 


^[q,v\ 


1 ^[v 


w],C[w,p\} = {1, 1, 1,0} 


{1,0,0,1} 




^[q,v\ 




tu],C[„,p]} = {1, 1,0,0} ^ 


{1,0,1,1} 


{^[p,<i\ 


^[q,v\ 


5 ^[v 


w],C[w,p]} = {1,0,0,0} -> 


{0,1,1,0} 
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In all cases one has at least one Cxy = before the swap and at least one c^y = after the swap, hence all 
n = 3 terms have Aijki;a{cip,q]C[g_^^C[y ^p]) = Aijke;c,{c[p^q]C[q^^^C[^^^]C[^^p]) — 0. 

— For each n = 4 term, which occur only in Tr(c''), all four index pairs {x, y) £ {[p, q], [q, v], [v, w], [w,p]} refer 
to links mapped according to Fijki-aCxy = 1 — Cxy. Since Fijki-a removes two bonds and adds two new ones, 
we must have C[p.g] + C[q.„] + C[„^u,] + C[^^p] = 2 both before and after the swap, so C[p^q]C[g_i,]C[j,_^]C[„^p] = 
Fijke:a{cip^q]C[q^y]C[y^^]C[^^p]) = and hence all n = 4 terms have Ziijfc£;c,(c[p,q]C[,,„]C[„_„]C[„_p]) = 0. 

It follows therefore that we may write, with the short-hands A/B — {x £ A\ x ^ B} and Cxy = 1 — Cxy, 

Aijke;aTr{c'^) = X]'5|s_p,„.i3fcf.„|,i^»jfc«;<:'(c[p,<j]C[9,ulC[t.,pl) 

pqv 

= 5Z^|Sp,.,.,fci!;cl,l( n C:,«)AjW;c( ^-«) 

=E'5is.,.....,»m( n ^^y){ n ^'-y- n ^^^^^ 

and 

= I] '^isp,„„,,.,,„i.i( n n n 

(3:>H)SSp,^„/Sp,t,„;ijfcf;Q (a:.H)eSp,„„.ijfc£,Q (a:,!/) £ Sp,„„. ijfcf 

(103) 

The above expressions can be simplified once more, taking into account the number of ways we can select the 
common index pair {a,b) from Spqv or Spq^^, respectively, and the fact that each such pair will be picked up 
twice (as [a,b] and [b,a], respectively) in the above summations, due to Cab ~ c^a- 

AjM;aTr(c^) = 2 ■ 3 E {l-2Cab) ^ CbvCva ^ & ^ (l-2Ca(,) ^ CbvCva 

(104) 

( \ 

Ajfcf;aTr(c'*) = 2 ■ 4 ^ (l-2Cai,) ^ ^ C6„C™C™a (105) 

[a,6]eSi3fcf.„ „^{a,b,bl;';^l} \ti,^{a,b,a'j';^'},(«,™)7^{ab^',6';';^') / 

The constraints over the sums in (|104|l and p05|l implement the (5s in (|102p and p03|) . and guarantee that no 
link other than [a, h] can be picked up from Sijki;a (note that (6, dL ^) and (a, &[j_.Q,^) ^ Sijk£;a Vq). 

We can verify briefly our results (|104I105|I for the nearly hardcore graphs, where the variation in the mobility 
term can be calculated explicitly. Referring back to Figure [T] the difference between the mobilities of a B type 
graph and graph A is 

Aijke:c.n{c) = nsic) ~ 71^(0) ^ - K - 2K + 2 ^ K'^ - 3K + 2 (106) 

If we identify sites K + 1 = j and K + 2 = i, we see that graph A is obtained from graph B by application of 
the edge swap Fjjke a- Hence, the variation in the mobility produced by the application of Fijki-i to graph B 
must be equal, via (|38|l . to 

Aijki;in{c) — -Ziijfc£;iTr(c'*) + -Ziijfc£;iTr(c^) — - ^ k^kwAijkt,iCvw (107) 

vw 

with Ziijfc£;iTr(c'*) and Z\ijfc£:iTr(c^) given by (fT04|l and (fTOS]) for a = l. 

For the nearly hardcore case one can see that the only link [a, b] € Sijki;\ contributing to the sums in (|104|l 
and p05|l is [k,£\, because if u ^ {a,^^.'^'} then Cbv = for 6 G {*, j}, and if / 6 then c^a = for a = j; so 
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the links [a, b] G {[i, j], [j, k], [£, i]} never contribute. Using Cki = 0, we have 



1 ^ 1 / 

-2. -(7^-1) ^ ^A,w;ic.„-2(^"^)M 

u6{z,j} li.' — 1 \{^'/ 



= 2(if - 2){K - 3) + 3(A: - 2) - 1 + 2(_ft: - 1) - (A' - 1)^ = - S-R' + 2 (108) 

where in the last equality we used the fact that Aijke;iCvw = 1 — 2c„„ if (u, to) G Sijkt, with Zlijfcf;ic„„ = 
otherwise, as well as Cjk = ca = 1 and Cij — 0. 



C.2 Bounds for Aijke-aT'^{c^) and Z\ijfe^:ctTr(c^) 

It is easy to construct bounds from (|104ll05p . based on the property 1 — 2caf, = ±1, on the fact that always 
precisely two of the four bonds in Sijki-a are zero, and on the inequality xy < ^{x + y) for x,y £ {0, 1}: 

\Aijke:aT'c{c^)\ < 12maxy^ Cai,C„5 < 6maxy^(Cai, + C^t) < 12fcniax (109) 
a^b ^ — ' a^b ^ — ^ 

V V 

I Ajfc^;aTr(c'^)| < 16 max CavCvwC^h <8maX } Cvw{Cav + C-^b) 

a^h ^ — ^ a^b ^ — ^ 

V .w^{a.b^ V .w^{a.b^ 

< 16max (^^Cijfcj^ (110) 

3 

with fcmax = maxi ki. We can finally simplify pi0|l in two ways. First, we may use kj < maxj kj, which gives 
maxi[X]j Cijfcj] < fcniax- Second, we could simply put max^X^j <^ijkj] < kj] = N{k). The result is: 

|A,fc^;aTr(c*)| < 16min{fci^,iV{fc)} (111) 



